Picture for Zhiqin Yang

Zhiqin Yang

Conditional Equivalence of DPO and RLHF: Implicit Assumption, Failure Modes, and Provable Alignment

Add code
May 20, 2026
Viaarxiv icon

Shaping Schema via Language Representation as the Next Frontier for LLM Intelligence Expanding

Add code
May 10, 2026
Viaarxiv icon

ClawNet: Human-Symbiotic Agent Network for Cross-User Autonomous Cooperation

Add code
Apr 21, 2026
Viaarxiv icon

FedRG: Unleashing the Representation Geometry for Federated Learning with Noisy Clients

Add code
Mar 20, 2026
Viaarxiv icon

MemFly: On-the-Fly Memory Optimization via Information Bottleneck

Add code
Feb 08, 2026
Viaarxiv icon

MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training

Add code
Sep 26, 2025
Figure 1 for MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training
Figure 2 for MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training
Figure 3 for MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training
Figure 4 for MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training
Viaarxiv icon

LearnAlign: Reasoning Data Selection for Reinforcement Learning in Large Language Models Based on Improved Gradient Alignment

Add code
Jun 13, 2025
Viaarxiv icon

HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation

Add code
Apr 01, 2025
Viaarxiv icon

Beyond Traditional Threats: A Persistent Backdoor Attack on Federated Learning

Add code
Apr 26, 2024
Viaarxiv icon

Robust Training of Federated Models with Extremely Label Deficiency

Add code
Feb 22, 2024
Viaarxiv icon